CDS
Accession Number | TCMCG075C10926 |
gbkey | CDS |
Protein Id | XP_007039794.2 |
Location | complement(join(32527613..32527777,32528139..32528261,32528338..32528452,32528587..32528685,32528762..32528814,32529428..32529584,32530044..32530102,32530270..32530337,32530423..32530566,32531239..32531302,32532079..32532129,32532232..32532371,32532480..32532619,32533059..32533144,32533231..32533376,32533502..32533574,32534300..32534375,32534615..32534722,32534874..32534920,32535143..32535188,32535998..32536110,32536207..32536242,32536340..32536543,32537140..32537205)) |
Gene | LOC18606235 |
GeneID | 18606235 |
Organism | Theobroma cacao |
Protein
Length | 792aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007039732.2 |
Definition | PREDICTED: DNA mismatch repair protein MSH4 isoform X6 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGAAGACGACGGAGGAGAGAGGTCAAGCTTCGTGATCGGTCTCATCGAGAACAGAGCTAAAGAGGTTGGAGTGGCTGCCTTTGACTTAAGATCAGCTTCTTTGCATCTTTCTCAATACATTGAAACCAGCAGCTCATATCAGAATACAAAAACTTTGCTTCATTTCTATGATCCCATGATGATCATTGTTCCTCCAAACAAACTGGCTCCTGAAGGTATGGTGGGAGTATCAGAACTAGTAGATCGGTTTTATGCTTCAGTCAAGAAGATTGTCATGGCTCGTGGTTGCTTTGATGACACCAAGGGTGCAATGCTGATTAAAAATTTAGCTGTCAGAGAGCCTTCAGCCCTTGGTTTGGATAGTTACTACAAACAGTATTATCTTTGCTTGGCTTCTGCTTCTGCTACAATCAAATGGATAGAAGCAGAGAAAGGTGTTATTGTCACAAATCATTCCTTATCGGTTACTTTTAATGGATCATTTGACCACATGAACATTGATGCTACTAGTGTCCAAAACTTAGAAATTATTGAACCTTTTCATTCTGCACTTTGGGGCACAAACAACAAGAAAAGAAGTCTATTCCACATGCTTAAGACAACAAAAACTGTTGGAGGGACTAGACTTCTTCGTGCCAATCTTTTGCAGCCTTTAAAAGATATCGAAACTATCAATACGCGTCTGGATTGCCTGGATGAGTTGATGAGCAATGAACAGCTATTCTTTGGACTGTCTCAGGTCTTGCGAAAGTTCCCAAAGGAGACTGATAGGGTACTTTGTCATTTCTGCTTCAAGCCAAAGAAAGTAACAAATGAAGTCTTGGTTGTGGAAAACACTAGAAAGAGCCAAATGCTGATATCAAGCATCATTCTTCTCAAAACTGCATTAGATGCCTTGCCGTTACTATCAAAGGTGCTTAAGGATGCAAAAAGTTTTCTTCTTGCAAATGTTTACAAGTCTATATGTGAAAACGAGAAATATGCTGACATTAGAAAGAGAATTGGAGTGGTGATTGATGAAGATGTGCTTCACGCACGGGTTCCTTTTGTTGCCCGCACACAGCAGTGTTTTGCTGTCAAGGCTGGCATTGATGGGCTATTGGATATAGCTCGGAGATCTTTTTGTGATACCAGCGAAGCTATACATAACCTTGCAAACAAGTACCGGGAAGAATTCAAGATGCCGAATCTGAAACTCCCATTTAACAGTAGACAAGGTTTTTACTTTAGCATTCCACAGAAAGACATTCAGGGACAGCTTCCCAGCAAGTTCATTCAGGTTGTGAAACATGGGAATAATGTACATTGTTCAACTTTGGAACTTGCTTCTCTGAATGTCAGAAATAAATCTGCGGCTGGAGAGTGTTATATACGAACAGAAGTTTGCTTGGAAGCCCTAGTTGATACCATAAGGGAGGATATCTCTGTGCTCACACTGCTTGCTGAAGTCCTGTGCCTGTTAGATATGATTGTTAATTCATTTTCTCATACAATATCAACCAAGCCTGTTGACCGATATATTAGGCCAGAATTTACTGATGATGGCCCTCTGGCAATTGATGCTGGTAGACACCCCATCCTAGAAAGCATACACTGTGATTTTGTGCCCAACAACATCTTTATTTCAGAAGCATCAAACATGGTTATTGCAATGGGGCCAAACATGAGCGGGAAGAGCACTTATCTTCAACAAGTGTGTCTCATAGTTATTCTTGCTCAGATTGGTTGCTATGTTCCTGCCCGCTTTGCAACAATTAGAGTAGTTGATCGTATATTTACAAGGATGGGCACAATGGATAATCTTGAATCAAACTCTAGTACGTTTATGACAGAGATGAAAGAGACTGCTTTTGTCATGCAGAATGTCTCCCAAAGGAGTCTGATTGTTATGGATGAACTTGGGAGGGCTACTTCGTCCTCTGATGGATTGGCAATAGCATGGAGCTGCTGTGAACATCTGCTATCACTCACTGCGTATACCATATTTGCTACTCATATGGAGAACTTGTCAGAATTAGCTACCATCTATCCAAATGTGAAAATTCTTCGCTTCGATGTTGATATTAGAAACAGCCGCCTAGATTTTAAGTTTCAACTCAAGGATGGACCAAGGCATGTAGCACACTATGGCCTTCTACTAGCAGAAGTGGCAGGATTACCGAGTTCGGTGATTGAAACAGCCAGAAGCATAACATCAAGGATTACAGACAAGGAAGTGAAGCGAATGGATGTAAACTGCCTGCACTATAATCAAATACAGTTGGCATATCATGTTTCTCAACGACTGATATGCTTGAAGTACTCCAACCATGACGAGGACTCCATCCGGCAGGCATTGCAAAGTCTCAAAGAGAGCTACATTGATGGTAGGCTCTAA |
Protein: MEDDGGERSSFVIGLIENRAKEVGVAAFDLRSASLHLSQYIETSSSYQNTKTLLHFYDPMMIIVPPNKLAPEGMVGVSELVDRFYASVKKIVMARGCFDDTKGAMLIKNLAVREPSALGLDSYYKQYYLCLASASATIKWIEAEKGVIVTNHSLSVTFNGSFDHMNIDATSVQNLEIIEPFHSALWGTNNKKRSLFHMLKTTKTVGGTRLLRANLLQPLKDIETINTRLDCLDELMSNEQLFFGLSQVLRKFPKETDRVLCHFCFKPKKVTNEVLVVENTRKSQMLISSIILLKTALDALPLLSKVLKDAKSFLLANVYKSICENEKYADIRKRIGVVIDEDVLHARVPFVARTQQCFAVKAGIDGLLDIARRSFCDTSEAIHNLANKYREEFKMPNLKLPFNSRQGFYFSIPQKDIQGQLPSKFIQVVKHGNNVHCSTLELASLNVRNKSAAGECYIRTEVCLEALVDTIREDISVLTLLAEVLCLLDMIVNSFSHTISTKPVDRYIRPEFTDDGPLAIDAGRHPILESIHCDFVPNNIFISEASNMVIAMGPNMSGKSTYLQQVCLIVILAQIGCYVPARFATIRVVDRIFTRMGTMDNLESNSSTFMTEMKETAFVMQNVSQRSLIVMDELGRATSSSDGLAIAWSCCEHLLSLTAYTIFATHMENLSELATIYPNVKILRFDVDIRNSRLDFKFQLKDGPRHVAHYGLLLAEVAGLPSSVIETARSITSRITDKEVKRMDVNCLHYNQIQLAYHVSQRLICLKYSNHDEDSIRQALQSLKESYIDGRL |